NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Automatic Input Rewriting Improves Translation with Large Language Models

Ki, Dayeon; Carpuat, Marine (February 2025, ArXiv)

Full Text Available
Automatic Input Rewriting Improves Translation with Large Language Models

https://doi.org/10.18653/v1/2025.naacl-long.542

Ki, Dayeon; Carpuat, Marine (January 2025, Association for Computational Linguistics)

Full Text Available
Sustaining Human Agency, Attending to Its Cost: An Investigation into Generative AI Design for Non-Native Speakers' Language Use

https://doi.org/10.1145/3706598.3713626

Xiao, Yimin; Hancock, Cartor; Agrawal, Sweta; Mehandru, Nikita; Salehi, Niloufar; Carpuat, Marine; Gao, Ge (April 2025, ACM)

Full Text Available
Sustaining Human Agency, Attending to Its Cost: An Investigation into Generative AI Design for Non-Native Speakers' Language Use

Xiao, Yimin; Hancock, Cartor; Agrawal, Sweta; Mehandru, Nikita; Salehi, Niloufar; Carpuat, Marine; Gao, Ge (April 2025, arXiv)

Full Text Available
Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Ki, Dayeon; Carpuat, Marine; Duh, Kevin; Gomez, Helena; Bethard, Steven (June 2024, Association for Computational Linguistics)

Machine Translation (MT) remains one of the last NLP tasks where large language models (LLMs) have not yet replaced dedicated supervised systems. This work exploits the complementary strengths of LLMs and supervised MT by guiding LLMs to automatically post-edit MT with external feedback on its quality, derived from Multidimensional Quality Metric (MQM) annotations. Working with LLaMA-2 models, we consider prompting strategies varying the nature of feedback provided and then fine-tune the LLM to improve its ability to exploit the provided guidance. Through experiments on Chinese-English, English-German, and English-Russian MQM data, we demonstrate that prompting LLMs to post-edit MT improves TER, BLEU and COMET scores, although the benefits of fine-grained feedback are not clear. Fine-tuning helps integrate fine-grained feedback more effectively and further improves translation quality based on both automatic and human evaluation.
more » « less
Full Text Available
Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

https://doi.org/10.18653/v1/2024.findings-naacl.265

Ki, Dayeon; Carpuat, Marine (January 2024, Association for Computational Linguistics)

Full Text Available
Do Text Simplification Systems Preserve Meaning? A Human Evaluation via Reading Comprehension

https://doi.org/10.1162/tacl_a_00653

Agrawal, Sweta; Carpuat, Marine (January 2024, Transactions of the Association for Computational Linguistics)

Abstract Automatic text simplification (TS) aims to automate the process of rewriting text to make it easier for people to read. A pre-requisite for TS to be useful is that it should convey information that is consistent with the meaning of the original text. However, current TS evaluation protocols assess system outputs for simplicity and meaning preservation without regard for the document context in which output sentences occur and for how people understand them. In this work, we introduce a human evaluation framework to assess whether simplified texts preserve meaning using reading comprehension questions. With this framework, we conduct a thorough human evaluation of texts by humans and by nine automatic systems. Supervised systems that leverage pre-training knowledge achieve the highest scores on the reading comprehension tasks among the automatic controllable TS systems. However, even the best-performing supervised system struggles with at least 14% of the questions, marking them as “unanswerable” based on simplified content. We further investigate how existing TS evaluation metrics and automatic question-answering systems approximate the human judgments we obtained.
more » « less
Full Text Available
Explaining with Contrastive Phrasal Highlighting: A Case Study in Assisting Humans to Detect Translation Differences

https://doi.org/10.18653/v1/2023.emnlp-main.690

Briakou, Eleftheria; Goyal, Navita; Carpuat, Marine (December 2023, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing)

Explainable NLP techniques primarily explain by answering “Which tokens in the input are responsible for this prediction?”. We argue that for NLP models that make predictions by comparing two input texts, it is more useful to explain by answering “What differences between the two inputs explain this prediction?”. We introduce a technique to generate contrastive phrasal highlights that explain the predictions of a semantic divergence model via phrase alignment guided erasure. We show that the resulting highlights match human rationales of cross-lingual semantic differences better than popular post-hoc saliency techniques and that they successfully help people detect fine-grained meaning differences in human translations and critical machine translation errors.
more » « less
Controlling Pre-trained Language Models for Grade-Specific Text Simplification

https://doi.org/10.18653/v1/2023.emnlp-main.790

Agrawal, Sweta; Carpuat, Marine (January 2023, Association for Computational Linguistics)

Full Text Available
Bridging Background Knowledge Gaps in Translation with Automatic Explicitation

https://doi.org/10.18653/v1/2023.emnlp-main.603

Han, HyoJung; Boyd-Graber, Jordan; Carpuat, Marine (January 2023, Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing)

Translations help people understand content written in another language. However, even correct literal translations do not fulfill that goal when people lack the necessary background to understand them. Professional translators incorporate explicitations to explain the missing context by considering cultural differences between source and target audiences. Despite its potential to help users, NLP research on explicitation is limited because of the dearth of adequate evaluation methods. This work introduces techniques for automatically generating explicitations, motivated by WikiExpl: a dataset that we collect from Wikipedia and annotate with human translators. The resulting explicitations are useful as they help answer questions more accurately in a multilingual question answering framework.
more » « less

« Prev Next »

Search for: All records